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Abstract 

In Achlioptas processes, starting from an empty graph, in each step 
two potential edges are chosen uniformly at random, and using some rule 
one of them is selected and added to the evolving graph. Although the 
evolution of such 'local' modifications of the Erdos-Renyi random graph 
process has received considerable attention during the last decade, so far 
only rather simple rules are well understood. Indeed, the main focus has 
been on 'bounded-size' rules, where all component sizes larger than some 
constant B are treated the same way, and for more complex rules very 
few rigorous results are known. 

In this paper we study Achlioptas processes given by (unbounded) 
size rules such as the sum and product rules. Using a variant of the 
neighbourhood exploration process and branching process arguments we 
show that certain key statistics are tightly concentrated at least until the 
susceptibility (the expected size of the component containing a randomly 
chosen vertex) diverges. Our convergence result is most likely best possible 
for certain rules: in the later evolution the number of vertices in small 
components may not be concentrated. Furthermore, we believe that for 
a large class of rules the critical time where the susceptibility 'blows up' 
coincides with the percolation threshold. 



1 Introduction 

In 2000 Dimitris Achlioptas suggested a class of variants of the classical random 
graph process, now called Achlioptas processes. Such a process defines, for 
each n, a random sequence (Gi)i^o — {Gf')i^o of graphs with vertex set [n] as 
follows: start with an empty graph Go on n vertices. At each step i ^ 1, two 
potential edges ei and 62 are chosen independently and uniformly at random 
from all (2) possible edges (or from those edges not present in Gi-i). One 
of these edges is selected according to a rule TZ and added to the graph, so 
Gi = Gi-i U {e} for e = ei or 62- (As usual in combinatorics, we omit the 
dependence on n in the notation to avoid clutter.) Achlioptas processes are 
a special case of the more general class of £-vertex rules, where in each step 
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£ ^ 2 vertices vi, . . . ,vi are chosen independently and uniformly at random and 
then at least (usually exactly) one edge between these vertices is added. Always 
adding e = ei (or e = V1V2) gives the Erdos-Renyi random graph process, 
which has been extensively studied for more than 50 years; by now many of 
its properties are extremely well understood, in particular the evolution of the 
component structure, see e.g. [51 1^ [TT]. 

During the last decade the evolution of certain 'simple' Achlioptas processes 
has received considerable attention, mainly for so-called bounded-size rules, see 
e.g. [21 m ini [m Uni HSl m] . These make their decisions based only on the sizes 
of the components containing the endvertices of ei and 62, with the restriction 
that all sizes larger than some constant B are treated in the same way. For 
bounded-size rules a number of results have been established concerning con- 
centration of the number of vertices in components of fixed size [6l [21] , the size 
of the second largest component [TQ, the existence and location of the perco- 
lation phase transition where the (unique) linear size 'giant' component first 
emerges [SI HI], the 'order' of the phase transition [TB], and the rescaled size of 
the largest component [H] . For one particular bounded-size rule (a variant of a 
process suggested by Bohman and Frieze [4]), some finer details of the percola- 
tion phase transition have recently been investigated [31 [T^ [13] . The punchline 
of the work mentioned above is that bounded-size rules seem to show many qual- 
itative similarities with the 'classical' Erdos-Renyi random graph process; in the 
language of mathematical physics they appear to be in the same 'universality 
class'. 

In contrast, for more involved Achlioptas processes very few rigorous results 
are known, although these have been widely studied in recent years. This in 
particular applies to the class of 'unbounded'-size rules, usually simply called 
size rules, whose choices depend only on the sizes of the four components con- 
taining the endvertices of the two offered edges. To illustrate our very limited 
understanding of these, we mention that in one line of research, stimulated by a 
conjecture of Achlioptas, D'Souza and Spencer published in Science [T] (based 
on 'conclusive numerical evidence'), it was believed that certain size rules (in 
particular the product rule) could give rise to a discontinuous ('first order') 
phase transition. However, recently it was rigorously shown in [TH] [50] that the 
phase transition is in fact continuous for all Achlioptas processes (even for a 
larger class of processes). The surprises that (unbounded) size rules have shown 
so far indicate that our intuition for these processes still needs to be developed. 
In fact, obtaining non-trivial results for involved Achlioptas processes using e.g. 
the product rule is well known to be a technical challenge (see e.g. [Tl ITU]). 

Given a graph G, let Nk{G) denote the number of vertices of G in compo- 
nents of size k, and define the susceptibility of G as S{G) = kNk.{G)/n, so 
S(G) is the expected size of the component containing a randomly chosen ver- 
tex. Let Li{G) denote the number of vertices in the (a, if there is a tie) largest 
component of G. We say that tc = t^ is the percolation threshold for the process 
{Gf-) if for t <tc whp Li(Gt„) — o{n) while for t > tc whp Li{Gtn) — ^[n) (as 
usual, we henceforth ignore the irrelevant rounding to integers, and say that an 
event holds whp, if it holds with probability tending to 1 as n — > 00). The anal- 
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ysis for bounded-size rules in [51[T21[2I] uses S{Gi) as well as Nk{Gi) for fixed 
k as key statistics. In the 'subcritical' regime t <tc, they use Wormald's differ- 
ential equation method to establish the existence of functions Pk = Pk', 
s = such that Nk{Gtn) ~ Pk{t)n and S{Gtn) ~ s{t) hold whp. Based on 
this they show that tc = is given by the blow-up point of the susceptibil- 
ity: limt/^t^ s{t) — oo. As indicated by Spencer and Wormald [H], for general 
size rules the approximation of the key statistics using the differential equation 
method seems difficult. Another intricacy for such rules is that the dependencies 
among the selected edges often seem to be more complex in comparison with 
bounded-size rules, for which a large subset of the added edges can be thought 
of as chosen uniformly at random (essentially, when all four endvertices are in 
components of size larger than B); we return to this in Section [3l 

1.1 Main result 

In this paper we establish the first rigorous convergence result for Achlioptas 
processes using unbounded size rules such as the product rule: we show that the 
number of vertices in components of size fc ^ 1 (and the susceptibility) is tightly 
concentrated until the susceptibility 'blows up', which happens at a critical time 
t\j. In fact, our result holds for a very large class of Achlioptas-like processes, 
including essentially all Achlioptas processes studied so far (see Section [5] for 
the formal definition of £- vertex size rules). Here S'(G^) A oo as n — > oo means 
that for any C > we have P(S'(G^) ^ C) -J> as n -j' oo, and N^k{G) denotes 
the number of vertices of G in components of size at least k. 

Theorem 1. Let £ ^ 2 and let TZ be an i-vertex size rule. There exist th = 
G [l(I3T)jl] and functions {pk)k^i with pk — p^ : [0,tb) [0,1] such that 
the following holds. For every t ^ ty, we have 

S{Gt) ^ oo (1) 

as n ^ oo. For every t < t^, we have X]fe>i Pkit) = 1- Also, for every t < t^ 
there exist a, A,C > (depending only on TZ,£,t) .such that for every t' £ [0,t] 
we have Pk{t') ^ Ae^""^ for all k ^ 1. In addition, for n ^ ni){TZ,£,t) the 
following holds with probability at least 1 — n^^^ : for every ^ i ^ tn we have 

\Nk{Gf) - pk{i/n)n\ «C (lognfn'/^ for all k^l, (2) 
ISiGf) - Ek^lkPk{^/n)\ ^ {\ognfn~'/^, (3) 

and N^k{Gf) ^ Ae-''^n for all fc > 1. 

To interpret this result, we think of the functions Pk{t) as describing the 
'scaling limit' of the component size distribution at 'time' t < t\^, where time 
is the number of steps divided by n. A key aspect of the result is that this 
limit does not depend on n; in fact, most of our technical work is devoted to 
establishing this property - to show only that Nk{G^) is concentrated around 
its expectation, simpler arguments would suffice. The tail bound on pk given 
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in Theorem [T] states that the ideahzed component size distribution has an ex- 
ponential tail for t < as one would expect in a strictly sub-critical random 
graph. It implies that s{t) = J2k>i ^Pk{t) < oo if i < tb, so implies that 
that for t < tb we have 

S{Gt) A s{t) < oo, (4) 

where A denotes convergence in probability. The proof of Theorem [1] will show 
that s{th — e) > {£{i — l)e)~^, so the (idealized) susceptibility s{t) blows up 
at ib- The last statement of the theorem implies that for t < th we have 
Li(G^J ^ Bt log n whp, for some constant Bt that depends on t. Finally, we 
shall show in the Appendix that (unless TZ directly adds cycles to the graph), 
for t <% whp almost all components are trees, with the rest unicyclic. 

Theorem [T] allows us to say something about what happens at time t = th- 
Indeed, the definition of an £- vertex rule ensures that in one step, at most £ 
components are destroyed and at most £ (in fact at most £/2) are created, so 
l-^fc(G^j) — Nk{Gf-)\ ^ £k. It follows that each pk is Lipschitz continuous on 
[0, tb) with constant k£. Hence we can extend each pk continuously to the point 
^b , and ([2]) and the Lipschitz properties of and pk imply that 

Nk{GlJ/n^pM. (5) 

Together with Theorem[Tl the continuity results of [TS1I20] imply that Li{Gti^n) /n A 
0, and that Pk{th) = 1, so the numbers {pk{th))k^i do capture the asymp- 
totic component size distribution of G^^, although we do not have such tight 
error bounds as in ([2]). 

The proof of Theorem [T] is based on a variant of the neighbourhood ex- 
ploration process and relies on branching process (approximation) arguments. 
This is quite different from previous approaches in this area, which are based on 
the differential equation method; for certain (restricted) classes of rules these 
establish local convergence, i.e., that there exist functions pk — p^ ■ — > [0, 1] 
such that, for each fixed k ^ 1 and t ^ 0, we have Nk{G'^^)/n -4 Pk{t) as 
n — >■ oo. The limitations of these approaches are that they (i) only apply to 
certain bounded-size rules [H |2T] , or (ii) when applied to size rules need the ad- 
ditional assumption that certain systems of differential equations have unique 
solutions 19 , which is not known to hold for the product rule, for example. So, 
Theorem [T] establishes for the first time (a strong form of) local convergence for 
unbounded size rules (such as the product rule) until the susceptibility diverges. 
We believe that this convergence result is best possible: on the basis of heuris- 
tics and simulations presented in |17j we believe that there are certain natural 
size rules for which beyond t — a giant component emerges whose size is not 
concentrated. In these rules the numbers of vertices in components of each fixed 
size k are presumably also not concentrated after this point. 

Theorem [1] has some analogies with 'classical' percolation theory on, for 
example, the infinite lattice Z'', where there are two a priori different critical 
probabilities and pT. Intuitively, these correspond to the thresholds for (i) 
having (with positive probability) an infinite cluster and (ii) the expected cluster 
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size being infinite. For essentially all 'natural' lattices of interest it is nowadays 
known that pu = Pt (see e.g. [H [16]), but this fact is not at all obvious! Note 
that in the finite setting of this paper these two properties correspond to (i) 
having a linear size component, and (ii) diverging susceptibility. More formally, 
define = as the supremum of the set of i ^ for which Li{Gf^)/n 4 
as n — ^ oo, and tb — ^^'-^ supremum of the set of t ^ for which S{G^) 

is bounded in probability. Note that Li{G) < y^nSlG) implies ^ ^c- The 
remark after Theorem [T] entails that for size rules th is equal to the infimum of 
the set of t > for which ([T]) holds, and that S{G'^^) A oo. In fact, we believe 
that both thresholds coincide for size rules (analogous to the 'classical' case). 

Conjecture 2. Let £ ^ 2 and let TZ be an (.-vertex size rule. Then t^ = t^ . 
More precisely, for any t > and e > there exist 5,no > (depending only 
on TZ,£,t,e) such that P(Li(G^) > Sn) ^ 1 — e for n ^ tiq. 

Recall that Achlioptas processes (where the choice is between two edges) 
are a sub-class of 4-vertex rules. Conjecture [5] was proved for bounded-size 
Achlioptas processes by Spencer and Wormald [21], and for a subset of these 
processes by Bohman and Kravitz [6]. We shall show in Section [SU] that it holds 
for all bounded-size ^-vertex rules, as well as many other size rules, including 
the 'reverse product rule', for example. However, it does not hold for general 
vertex rules. Indeed, in Section [3.21 we show that modified size rules with one 
additional feature, namely that they may once switch their behaviour based 
on the number n of vertices and the number i of steps (or the value of the 
susceptibility), can delay the appearance of a linear size component for Q{n) 
steps beyond the point where the susceptibility diverges. 

2 Evolution of Achlioptas processes with an ini- 
tial graph 

In this paper we consider the evolution of Achlioptas processes starting with an 
initial graph F with vertex set V = [n\; we restrict our attention to £-vertex size 
rules TZ, whose decisions depend only on the sizes of the components containing 
the randomly chosen vertices. More precisely, each such rule TZ yields a random 
sequence (-F/^)i5so of graphs on V with F^^ — F. For every i ^ we draw 
i vertices Vi^i = (wi, . . . , vi) from V independently and uniformly at random, 
and then, writing Cj_|_]^ = (ci, . . . , q) for the sizes of the components containing 
vi, . . . ,V£in F^, we obtain obtain F^-^ by adding a non-empty set of edges Ei+i 
to F^, where TZ deterministically selects as a subset of all pairs between 
vertices in v^_^.l based only on the list of component sizes Cj^j^. Usually exactly 
one edge is added, but there is no reason to insist on this. 

When = Go is the empty graph on n vertices we obtain the 'standard' 
Achlioptas processes using £- vertex size rules TZ as defined in [TH] . As usual, we 
can allow for small variations in the above definition; this includes, for example, 
each time picking an ^-tuple of distinct vertices, or picking (the ends of) i/2 
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randomly selected (distinct) edges not already present in G^, see also [15]. For 
£ = 2 we thus recover the 'classical' Erdos-Renyi random graph process by 
always adding the pair viV2- In addition, our proofs can be written to allow TZ 
to make randomized decisions (with the probability of adding some set of edges 
depending only on c^_^_l), and, furthermore, to allow TZ to know which vertices 
in are in the same components of (for compatibility with jl8] we then 
require Ei+i ^ whenever all Vj are in distinct components, although nothing 
in the proof of Theorem [TJ except for the bound ^ 1, needs this). 

One difficulty in the proof of Theorem [1] is that there is a complicated de- 
pendence between the decisions of TZ in each round (and their order is also 
important). Indeed, changes can 'propagate' throughout the process: if the 
sizes of a few components are modified (e.g. by altering decisions of TZ or tuples 
V = (vi, . . . ,vi) offered), then this might change many future decisions of TZ, 
which in turn might alter further decisions, etc. To overcome this our proof 
proceeds by induction, always establishing concentration only for a small num- 
ber of steps; this is also the reason why we study the more general evolution 
starting from an initial graph F. Each time we rely on a two-round exposure 
argument: in the first round we reveal which tuples are selected, and in the 
second we then expose their order. For size rules not all tuples and components 
of F 'influence' the size of the component in Fj^ containing v: only those which 
can be reached from v after adding all pairs of each £-tuple to the graph (every 
rule only selects a subset of these pairs) . The key observation is now that given 
the corresponding 'relevant' tuples and components of F of the first round, the 
order of these tuples (exposed in the second round) determines the size of the 
component containing v. It turns out that if we only consider an rounds for a 
sufficiently small, then an exploration process determining these relevant tuples 
and components in the first round can be closely approximated by a subcritical 
branching process which is defined without reference to n. Since the out- 
come of the second round is a (random) function of the first one, it thus seems 
plausible that EiVfe(Fj^)/ n is independent of n (up to small error terms). In 
addition, since the first round is subcritical, this means that there typically are 
not too many tuples and components which influence the size of the compo- 
nent containing v. At least on an intuitive level this makes it plausible that it 
should be possible to establish concentration of Nk{F]^) around its expectation 
by applying McDiarmid's inequality. 

The rest of this paper is organized as follows. In the next section we state 
our main technical result (Theorem [3]), and then show in Section [221 how it im- 
plies Theorem [Tj Afterwards, in Section [2.3l we present some branching process 
preliminaries; these are used in Section [2^ where we establish Theorem [3] In 
Section [3] we discuss Conjecture [U giving examples of classes of size rules for 
which we can prove the conjecture, and examples of non-size rules for which 
it does not hold. Finally, in the appendix we consider the cycle structure of 
Achlioptas processes. 
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2.1 Main technical result 



Our main technical result establishes concentration during the evolution of 
Achlioptas processes starting with an initial graph F. The special case of 
an Erdos-Renyi evolution from an initial graph F (which can be seen as an 
evolving version of a special case of the inhomogeneous random graph model of 
Bollobas, Janson and Riordan jJl) has been previously studied by Spencer and 
Wormald i2l[ and Janson and Spencer [T^], the main focus being on the size of 
the largest component. In this context the susceptibility turns out to be the key 
parameter, and both papers use in essential ways that the Erdos-Renyi evolu- 
tion corresponds to the addition of uniform random edges (or pairs of vertices) . 
In contrast, when studying the evolution of Achlioptas processes, we need to 
deal with intricate dependencies between the edges added. 

Using susceptibility as a guide, we now briefly motivate the number of steps 
our result applies to. Suppose that, starting with F satisfying S{F) = L, we 
use the rule I which in each step joins all i random vertices by edges. Set 
s{t) = S{F^). If the sizes of the joined components are q, then (assuming that 
all components are distinct) the susceptibility changes by {J2 Ci)^ /n — J^^i /n = 
'Ylii^j CiCjjn. So, since the vertices of each tuple are chosen uniformly at random, 
it seems plausible that typicality we have s'if) w r&{S{Ff^j^^ — S{F^)) w 
£{e-l)s{t)'^. For t < [e{£~l)L]-^ = U this suggests s{t) w [1/ L - £{£ - l)t]-\ 
Since in each step any rule TZ only adds a subset of all (2) pairs to the graph, 
this indicates that the susceptibility does not 'blow up' as long as t < ts- The 
following result confirms this heuristic argument and shows that, under suitable 
conditions, for t < tg the number of vertices in components of size fc ^ 1 is also 
tightly concentrated (the function p intuitively results from an 'infinite' version 
of the rule TZ). Here we set xif) = J2k>i k'pik) and xiP^t) = J2k>i ^Pi^T't)^ 
and write a; = a ± & as shorthand for a; G [a — fe, a + 6] . 

Theorem 3. Let £ 2 and let TZ he an £-vertex size rule. Suppose /3 > 1, 
B > 0, L^l and ip :N ^ [0,1] satisfy 

Ek^Mk) - 1, (6) 

B, (7) 

X(^) L. (8) 

There is a function p : N x M+ — > [0, 1] (depending only on ip,TZ,£) such that 
for all a ^ satisfying 

a <[£{£- l)L]-^ (9) 

there exist P,B,L > 1 (depending only on £, L,a, /3, B) such that for every 
t Cz [0,(7] equations (jSl)-® hold when /3, B, L,ip{-) are replaced by B, L, p{-,t). 
If in addition F is a graph on n vertices luhich for C ^ satisfies 

NkiF) = ip{k)n ± (log nfn^^^ for allk^l, (10) 
Eke[n]NkiF)l3^ ^ Bn, (11) 
S{F) ^ L, (12) 
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then, setting C — C + 9, for n ^ nQ{£, L,a, /S, B,C) the following holds with 
probability at least 1 — rt^^'^°; for every ^ i ^ an we have 

S{F^) = x{p, ^/n) ± (log n)^n-i/2, (13) 
and equations (|10p - p2p holdwhen /3,B,L,C,F,ip{-) are replaced by /3, B, L,C, F^^, p{-,i/n). 

The proof of Theorem [3] is quite involved and is deferred to Section 12.41 It 
is useful to observe that since /3 > 1 holds, ([7]) and (fTTj) imply the tail bounds 
^^^{Ej^kVij),N^kiF)/n} ^ Bp-'' for all fc ;^ 1, so Li{F) = O(logn). By 
Theorem [3] analogous estimates also hold for Fj^ with i ^ an. In fact, for (ITT|) . 

to hold with 13,B,L,F replaced by B, L, F^, a minor modification of 
our proof shows that it suffices to assume ([5]), (ITT|) and only; for the special 
case £ — 2 this was established by Spencer and Wormald [H] under similar 
conditions. However, the key point of Theorem [3] is (|TOl) . i.e., that we obtain 
concentration of number of vertices in components of size k. 

Turning to the susceptibility, by combining (jlOp with the tail bounds fol- 
lowing from ([7]) and pip , for each fixed j we readily obtain rather precise 
estimates for Sj{F^) = X]fce[n] k^Nk{Fj^)/n with i < an, similar to Fur- 
thermore, since Li(Fj^) = O(logn) whp, we can easily use the differential 
equation method to make our heuristic discussion regarding the susceptibility 
rigorous, which e.g. yields x(PiO') ^ i^/L — t[t — l)""]"^ (for the special case 
£ — 2 this was noted by Bohman et. al [5]; it is also implicit in [21])- However, 
this crude bound, which follows from always connecting all £ vertices by edges 
in each step, is generally far from the truth; for this reason it does not suffice in 
our inductive application of Theorem [3l where we use the 'correct' value given 
by (ini). 

2.2 Proof of Theorem [D 

This section is devoted to the proof of Theorem [U which we establish by an 
inductive application of Theorem [3] each time we show concentration during a 
small number of steps (and maintain certain technical conditions), where the 
lengths of these intervals decrease as the susceptibility increases. This is also 
the main idea of the following rather technical construction: as we shall see in 
the proof of Lemma HI for each interval of length Aj it determines the scaling 
limits pfe (and certain tail bounds) in a way that does not depend on n. 

Wc inductively define a sequence (/3j, Bj, pj, A^-, Lj)j^o with f5j > 1, Bj > 0, 
Aj ^ 0, Lj ^ 1 and pj : NxM+ — > [0, 1], where the (3j are decreasing (Pj+i ^ /3j) 
and the Bj are increasing (Bj+i ^ Bj). In addition, for each j ^ the sequence 
satisfies the following invariant: for every t £ [0, A^] equations ([6|) and ^ hold 
with l3,B,ip{-) replaced by (3j,Bj, pj{-,t). We start by setting (3o ^ Bq = Lq = 
2, Ao = and defining po : N x R+ [0, 1] with po^l, 0) = 1 and po{k, t) ^ 
otherwise. Given j ^ 1, recall that x{Pj-i:t) — J2k^i kpj^i{k,t) and set 

=x(p,-i,A,_i)-Kl and A^^[£{£-l){L^ + l)]-\ (14) 
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Applying the first part of Theorem [3] with /3 = [3j-i, B — Bj-i, L = Lj and 
fik) = pj-i{k, Aj-i), we use the resulting p to define pj = p. Furthermore, 
by considering a — Aj we obtain /3,B and set /Sj = mm{/3, Pj-i} and Bj ~ 
max{_B, by Theorem [3] these satisfy the required invariant. Furthermore, 

it is not difficult to see that the entire sequence {j3j, Bj, pj, Aj, Lj)j-^o depends 
only onTZ,£. 

Next we combine the pj (each valid on an interval of length Aj) to form 
if(k,t), which will eventually be pkit) in Theorem[TJ this notation avoids con- 
fusion with the Pj used. For t ^ we define rt as the smallest r such that 
t ^ J2o<j<r ^3 ^^"^ set r = oo if no such r exists. For all (fc, t) e N x M+ set 

^{k,t) = ^-(^'^-So^.<n^.)' -^inKoo, ^^^^ 
I 0, otherwise. 

Transferring this definition to the invariant of the sequence introduced above, 
for alH ^ with < oo it follows that 

E,^i^(fc,t) = l, (16) 

and that for every t' e [0, t] we have 

Efc^i^(fc,0/?' (17) 

Now we are ready to prove the following concentration result, which also 
implies that in the previous construction we always have rt < oo if ([T]) fails. 

Lemma 4. Let € ^ 2 and let TZ be an £-vertex size rule. For every t ^ for 
which ^ fails we have rt < oo, and there exist a,A,C > (depending only on 
TZ,£,t) such that the following holds for n ^ no{TZ,£,t) with probability at least 
1 — n~^^ : for every ^ i ^ tn we have 

Nk{Gf) = ip{k,i/n)n±{\ognfn^''^ for all fc ^ 1, 
and N^kiGf) ^ Ae-^'^n for all k^l. 

Proof. Given t ^ 0, if ([1]) fails there exists £ > and an infinite subsequence n 
of N (depending only on TZ, £, t) satisfying 

P(5(G?|) < e-i) ^ e. (18) 

Let I = + 3 and K = \t£{l - l)L'\ + 1. Let to = 0, and for j ^ 1 let 



if tj^i > t, 
tj^i+Aj, otherwise. 
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For n ^ no{TZ,£,t) we inductively show that for every ^ j K, setting 
Cj = 9j + 2, with probability at least 1 — jn^^"", for every ^ i ^ tju we have 

NkiGf ) = ip{k,i/n)n±{\ogn)'^^n^/^ for aU fc ^ 1, (19) 
Eke[n]Nk{Gf )l3^ < B,n, (20) 
SiGf-) = x{^,^/n) ± (logn)^^n-i/2, (21) 

and for every 5j s ^ j with ts ^ t wc have 

x{^,ts)^x{Ps,As) <L-2. (22) 

Note that if tK < t, then substituting ^ into (HH) yields Aj > 1)L]-^ for 

all 1 ^ j ^ K . From K > t£{£— 1)L it thus follows that tx > t, a. contradiction. 
Thus ([22]) implies tx > t, i.e., < cxd. Recall that {(3j,Bj)j^i and K depend 
only on TZ, i and on TZ, £, t respectively. Hence the induction hypothesis for 
j = K implies Lemma IH where the tail bounds follow from ([20]) as /3a' > 1. 

For the base case j = we start with an empty graph on n vertices, and it is 
easy to see that (HHl-dlll) hold with (3^ = Bq = Cq = 2 and <^(fc,0) = po(fc,0), 
as defined above (ITU) . 

Given j ^ 1, for the induction step we may assume that tj-i ^ t (otherwise 
tj = tj-i, and there is nothing to prove). We first assume that G'^_^„ satisfies 
the induction hypothesis, i.e., (fT9l) -(|22 |) with j replaced by j — 1. In particular, 
(fTe]) and (dll) hold for t = tj^i with rt ^ j - 1, and we have S{G'^_^^) < 
Xi^,tj-i) + 1 = xiPj-ii Aj^i) + 1 ^ Lj for n ^ no{Cj-i). Now we condition 
on G^^ jn — F and, analogous as after (fT4|) . apply Theorem [3] with (3 — (3j-i, 
B = Bj-i, L = Lj, a = Aj, C = Cj-i and <p(fc) — Pj_i(fc, Aj_i) = ip[k,tj-i), 
which is possible by the induction hypothesis (and the properties established 
above). So, for n ^ no(^, ij, Aj, /3j_i, Cj_i), with probability at least 

1 - foj. gygj.y ^ i <; Ajn the graph Fj^ satisfies dS])-©, (ITU)) -([TT |) 

and (fT3)) when /?, C, F, </?(•) are replaced by /3, B, (7, i^/^, p{-,i/n), where C = 
Cj-i+9. Note that for size rules is exactly conditional on G^_^„ = i^. 

It is crucial that l3,B,G,p do not depend on the initial graph F, and that by 
construction p — pj, j3j ^ /3, Bj ^ B and Gj = G. So, by appealing to 
the induction hypothesis and recalling (fTS)) . it follows that with probability at 
least 1 — (j — 1)^^^"" — n^'^^'^ equations ([T9l) - ([2T|) hold. It remains to show 
that ([22l) holds. To this end recall that (|2T|) holds with probability at least 
1 — jn~^°° > 1 — e for all n ^ no{Gj, K,e). So, using that susceptibility is 
monotone increasing, by (|18l) it follows that for all ^ t' ^ mm{t,tj} we 
have x('/',t') < + 1 = L — 2, say. Now (l22t follows by combining the 
previous estimate with the observation that for every s ^ j with tg ^ t we have 
x{^,ts) — x(Ps, As). This completes the induction step. 

Finally, to see that n ^ nQ{Tl,£,t) suffices note that in each of the K steps 
we only used n ^ uq^I, Lj, Aj, /Sj^i, Bj-i,Cj-i,Cj , K, e), where Gj = 9j + 2 
and Lj, Aj , jSj^i, Bj^i depend only on TZ,£. This concludes the proof since e 
(and thus K) only depends on TZ, £, t. □ 
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Now we define tb = as the infimum of the set of t ^ for which ([T]) holds 
as n — 7> oo ; so (m fails for t < th- The remark after the proof of Lemma 4 in [18] 
implies that for t > 1 we whp have Li{G^) ^ cn for c = c{£,t) > 0, yielding 
SiC^J ^ [Li{GfJ\'^/n ^ c^n; so % ^ 1. Furthermore, for t < [£{£ - 1)]-^ an 
application of Theorem [3] to the empty graph F = on n vertices with a = t 
and L = 1 (similar as in the proof of Lemma|3]) readily shows S{G^„) ^ L whp, 
so t\y ^ [£{£— Now suppose that ([T]) fails for i = ib- The proof of Lemma|4] 
then shows that whp (IT51) ~(PT |) hold for i = t\^n, and that xifj^b) < L — 2. 
It follows that we can apply Theorem [3] with a — [i'^L]^^ and L ~ L; this 
implies S{G^^^^yJ ^ L whp, contradicting the definition of tb- So, since the 
susceptibility is monotone increasing, it follows that ([TJ holds for all t ^ th. 
Combining our findings, Lemma SI and ([T7)) now yield Theorem [T] with 
Pk{t) = (p{k,t). 

2.3 Branching processes preliminaries 

The following basic results for branching processes will be used in the proof 
of Theorem [31 They are similar to Theorems 3.2 and 3.3 in 21 , where they 
are attributed to much earlier results of Cramer. Given a non-negative integer 
valued random variable X, let Fx{z) — Ez"^ denote the (probability) generating 
function of X. Note that Fx{z) is convex and monotone increasing for z ^ 0. 

The first lemma essentially states that a two-generation branching process 
has (uniform) exponential tails provided that the generating function of each 
offspring distribution has radius of convergence strictly larger than one (and 
thus also exhibits exponential decay). 

Lemma 5. Let X,Y ^ be integer valued random variables with Fx (a) ^ A 
and Fy{P) ^ B, where a,/3 > 1. Let Z be the number of grandchildren in the 
two-generation branching process in which the root node has X children and then 
each child, independently, has Y children. There exists a > (depending only 
on a, /3,A,B) such that P(Z ^ s) ^ Ae^"'' for all s^O. 

Proof Pick C ^ max{S,2} such that a; = l-|-(a-l)(/3-l)/(C-l) s; /3. Using 
Fvil) = 1 and FviP) s$ S < C, convexity yields ^V(z) < [{z - 1)C + W - 
z)]/(/3 — 1) for all z E [1,/?]. So, by choice of x we have Fy{x) ^ a. Observing 
that Fz{z) — Fx{Fy{z)), using monotonicity we obtain Fz{x) = Fx{Fy{x)) < 
Fx{oi) ^ A. Since a; > 1 implies Fz{x) ^ P{Z ^ s)x* for every s ^ 0, we 
deduce P(Z ^ s) ^ Ae^"*' for a = log a; > 0, completing the proof. □ 

The second lemma is a standard result for subcritical Gallon- Watson branch- 
ing process: these exhibit (uniform) exponential decay if the offspring distribu- 
tion itself has (uniform) exponential tails. 

Lemma 6. Let Z ^ be an integer valued random variable with EZ ^ /i < 1 
and Fz{/3) ^ B, where /3 > 1. Let T be the total size of the Galton-Watson 
branching process in which each node, independently, has Z children. There 
exist 5 > 1 and D > (depending only on /3,B,fi) such that Ft{S) ^ D. 
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Proof. Let f{t) = E(e*(^-i)). Observe that /(O) = 1 and /'(O) = E{Z - 1) < 
/X - 1. As m the proof of Lemma [3 Fz(^) B for /3 > 1 yields P{Z ^ 
s) ^ Bf3~'^, which in turn readily implies that for some C = C{/3,B) we have 
f"{t) = E((Z- l)2e*(^-i)) ^ C for aU < t < (log/3)/2, say. So, using Taylor's 
theorem, forO s$ t (log /3)/2 we deduce /(i) l + {fi-l)t + Ct'^/2 = h{t). Let 
X = min{(log/?)/2, (1 — n)/C} > 0, and observe that c = max{/i(a;), 1/2} > 
satisfies f{x) ^ c < 1. Exploring the branching process tree as usual in breadth- 
first search order, we see that T > s implies -^i ^ "where the Zi are 
independent copies of Z (corresponding to the number of children of the i-th 
node). Now, using Markov's inequality and independence of the Z^, for every 
s ^ we obtain P(r > s) ^ E(e^(^^=i ^»))e-^'* = /{x)' ^ c". Finally, picking 
1 < (5 < 1/c, it follows that Ft{6) < D = D{6, c), as claimed. □ 

2.4 Proof of Theorem [3] 

The proof of Theorem[3]relies on a two-round exposure argument; we first reveal 
the random tuples selected, and afterwards expose their order of appearance. 
It will be convenient to work with a continuous-time random graph model, 
where the tuples arrive according to independent Poisson processes with 
rates l/n^~^. So tuples appear with rate n, and each tuple is chosen uniformly 
at random and independently of all previous choices. Let Ef denote the set 
of tuples which arrive in [0,^]; so \Et\ ^ Po{tn). Observe that for each tuple 
u € [nY the number Au{t) of its arrivals in [0, t] satisfies Ajit) ~ Po{t/n^~^), and 
that these random variables are independent for different tuples. Furthermore, 
writing x = t/n^~^ and using ^ 1 ~ x twice, note that for £ ^ 2 we have 

P(yl„(t) ^ 2) = 1 - e"^ - xe-^ ^ x(l - e-"=) <C t^/n'^. (23) 

Similarly 

P(A„(t) ^ 1) = 1 - s; a; = t/n'^~\ (24) 

Starting with F, for each tuple u^Etwe join all (2) pairs of vertices by edges, 
and we denote the resulting graph by Ht . We define H]^ as the graph which we 
obtain by starting with _F, and then presenting the tuples to TZ (together with 
the component sizes of the vertices) in a random order, always updating the 
graph according to the decisions of TZ (adding the pairs selected by Tl). Since 
conditioned on ji^tl = z we have i tuples chosen independently and uniformly at 
random, it follows that 

E(iVfc(i7,^) \ \Et\^i)^ ENk{F^). (25) 

Furthermore, mimicking the proof of Pittel's inequality (see e.g. [8j) for < 
tn < n^, a short calculation shows that for any graph property Q we have 

nFj^ ^ 2) < sVt^ ■ nHf- ^ Q). (26) 

In the following sections we always tacitly assume that the assumptions of 
Theorem [3] hold and consider t — t{n) satisfying 

< ^ (T ^ 1, (27) 
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where cr ^ 1 follows from Furthermore, unless stated otherwise, we will use 
the continuous-time random graph models Ht and H^. For later usage let 

C/=(logn)6/^ (28) 
2.4.1 Component exploration process for i = 2 

Our main ingredient for analyzing the first exposure round is a certain explo- 
ration process. Given a (random) vertex v, it finds all tuples in Et and com- 
ponents of F that are 'relevant' in the second exposure round for determining 
\Cv{HJ^)\, where we write Cv{G) for the set of vertices of G that are in the 
same component as v. As certain details are rather technical for Achlioptas 
processes, here we first outline some of the basic ideas and techniques for the 
simpler case of an Erdos-Renyi evolution starting from an initial graph F (in 
this special case similar ideas were used by Spencer and Wormald [H]). This 
formally corresponds to the special case £ — 2 and the rule which always adds 
the offered pair viV2 to the evolving graph; so Ht = Hf-. 

One major difference to the Erdos-Renyi case (where we start with an empty 
graph on n vertices) is that here we have two sources of edges: (i) the initial 
graph F and (ii) the random pairs in Et . As edges of type (i) are deterministic 
and those of type (ii) are random, our exploration process explicitly considers 
them separately. In the first round we start with a randomly chosen v and mark 
all u € Cv{F) as reached; all other vertices are unreached. In each later round 
we sequentially go through the vertices w reached in the previous round (the 
order does not matter here) and determine all its so far unreached neighbours 
u in Et (corresponding to pairs (ui,U2) G Et containing u and w), each time 
marking all u € Cu{F) as reached. Note that upon termination Cy{Ht) equals 
the set of all reached vertices. 

The previous procedure yields an associated 'exploration tree' Tv{Ht) in a 
rather natural way: loosely speaking, it is a child of w if u was 'reached' via w. 
With an eye to the upcoming analysis for size rules, here we already introduce 
different types of nodes: vertex nodes, component nodes, and root nodes. More 
precisely, we define %,{Flt) inductively as follows: it has a root node v, whose 
children are vertex nodes u € Cy{F). Then, given any vertex node w, each of 
its so far unreached neighbours u in Et yields a component node as a child, 
which in turn has vertex nodes u e Cu{F) as children. It follows that the set 
of all vertex nodes in Tv{Ht) equals Cv{Ht). The main point is that, even after 
ignoring all labels, the structure of Tv{Ht) is enough to determine \Cv{Ht)\. 

The key idea is now to approximate Tv{Ht) by an 'idealized' branching pro- 
cess, similar as in the 'classical' Erdos-Renyi case (exploiting, as usual, that by 
construction every edge is tested at most once). Recall that in Tv{Ht) already 
reached vertices are 'ignored'. So, noting that endpoints of random pairs in 
Et correspond to random vertices, and that each edge gives rise to two ordered 
tuples, it seems plausible that Ty{Ht) is dominated by (may be regarded as a 
subset of) a branching process l^^t where (ignoring for simplicity the root and 
all labels) every vertex node, independently, has Po(2i) component nodes as 
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children, each which in turn, independently, has A'' vertex node descendants, 
where A'' ^ |C„(-F')| for a randomly chosen vertex u. Now, using ([9]) and (|12p 
each vertex node has in expectation 2t ■ S{F) ^ 2<tL < 1 vertex nodes as grand- 
children, so we expect that 'Xy^t resembles a subcritical branching process which 
has O(logn) size with very high probability. From this it follows that Tv{Ht) 
and ly^t are both small and have similar offspring distribution (as not too many 
vertices are reached and thus ignored), so it seems plausible that we can couple 
them so that they agree whp. Note that T„_t still depends on n and the initial 
graph F. Define P(i? — k) = ^p{k), where (p is given by Theorem [31 The point 
is now that using (ITUl) it follows that R is very close to TV. So, denoting by 
X^^t the 'idealized' version of l^^t where we use R instead of N, the former 
considerations suggest that there is a coupling such that whp Ty^t = X^p^t holds 
(ignoring the labels of the vertices). To summarize, we just outlined that using 
the 'intermediate' process T„^t we can couple Tv{Ht) and X^p^t so that they typ- 
ically agree up to isomorphisms. Consequently, the distribution of \Cy{Ht)\ can 
be approximated using 3C,^.t, which does not depend on n or F. 

In the above construction and analysis we used in essential ways that in 
each round only one pair of vertices is chosen and connected by an edge. In 
contrast, when considering Achlioptas processes several vertices v = {vi, . . . ,Vi) 
are chosen in each round, and only a subset of the edges between these vertices 
is added to the evolving graph. Furthermore, in the second exposure round 
the order in which the tuples v are presented matters (as well as the order 
of the vertices in each tuple). This motivates the more involved exploration 
processes used in the next section, whose associated exploration tree captures 
more detailed structural information (also using more types of nodes). 

2.4.2 Component exploration process (the general case) 

In this section we consider the first exposure round, where the selected set of 
tuples Ft is revealed. Note that this defines Ht, which we obtain by starting 
with F and then joining all i vertices of each tuple in Et by edges. Using 
a natural variant of the standard neighbourhood exploration process, for any 
vertex v we can determine Cy{Ht) as follows. First we determine Cy{F), i.e., 
find all other vertices which are in the same component of as w. Then, for 
each w G Cv{F) we find all tuples u = (ui, . . . ,Ui) e Ft containing w, and 
repeat the same procedure (recursively) for each uj 7^ w, see Figure [TJ Observe 
that for determining Cv{Ht) it suffices to consider only those vertices Uj ^ w 
which we have not already reached in some previous exploration step. 

In the analysis it is easier to start with a random vertex v and break down 
the above exploration process into small steps, constructing an an associated 
exploration tree Tv,t — Tv,t{F). As we shall see, one key property of T^^t is that 
we can (typically) reconstruct the vertices and components which have been 
reached, as well as the tuples which have been 'tested' so far. The vertices of 
each exploration tree have different types: vertex nodes, component nodes and 
tuple nodes will represent vertices, components of F and ^-tuples, respectively. 
For technical reasons we also have root nodes and index nodes. We denote the 
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Figure 1: Example of the neighbourhood exploration process for £ = 3. It 
determines Cy{Ht) by first finding other vertices in the same component of F, 
then finding tuples containing them; afterwards it repeats this procedure for the 
new vertices in those tuples, and so on. 

vertex nodes of Tv^t by Vy,t- 

As mentioned above, our exploration starts with a random vertex v, which 
serves as the root node of T^^t, see Figure O Next we (deterministically) find all 
vertices w S Cy{F) and then add the vertex nodes w as children of the root. In 
the following we sequentially traverse each level containing vertex nodes (which 
essentially corresponds to a breadth first search). Given a vertex node w, we 
add £ index nodes wi,. . . ^wi as children, where Wj is an index node of type j. 
For each j — we sequentially test for the presence and multiplicity of 

all so far untested tuples u = {ui, . . . , ug) with Uj = w; we denote the resulting 
multiset of found tuples by Sj^w Now we sequentially traverse the u € 5^,^,. 
For each such u— {ui, . . . ,ui) we add a tuple node u and traverse the Ui with 
i ^ j sequentially. For each i ^ j, we add a component node Ui of type 
as a child of u, where ~ i for i < j and Xj{i) — i — 1 for i > j (so 

that the component nodes {ui} with i ^ j have types 1, ...,£— 1). If is 
already contained in Tv^t then we 'ignore' this component node. Otherwise we 
add vertex nodes w € C„. (F) as children of Ui, see Figure [2] Note that Cy{Ht) 
consists exactly of the union of all vertex nodes of Tv.t, so 

C^{Hf)(ZC,{Ht)=V,.t. (29) 

The main point is that whenever no component nodes are ignored, then 
from Tv,t we can reconstruct all explored tuples (in Et) and components (of 
F), which for size rules are the only ones relevant for determining the size of 
Cy{H]^). In fact, up to relabellings, we can reconstruct these tuples and the 
relevant component sizes of F without looking at the vertex labels (the tree 
structure, including the node types, is enough). Motivated by this we say that 
Sj^w is had if one of the following conditions hold: 

• Sj_w contains some tuple u — (wi, . . . , ug) multiple times. 

• Sj^w contains a tuple u = (ui, . . . ,ui) where Ui with i ^ j \s already a 
vertex node of Tv,t constructed so far. 
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Figure 2: Example of the exploration tree for £ — 3. The children of the 
root vertex v are w G Cv{F) (vertex nodes), which in turn each have children 
wi, W2, W3 (index nodes of types 1, 2, 3). Every Wj has all (so far untested) tuples 
u = {ui,U2tU^) G Et with Uj — w children (tuple nodes), whose descendants 
are component nodes Ui with i ^ j (of types 1,2). If is not already a vertex 
node of Tv,t, then its children are w G Cui{F) (vertex nodes), for which we 
repeat the above construction. 

• Sj^w contains a tuple u = (ui, . . . , ug) where Ui and Uk with i ^ k are in 
the same component of F (note that this holds for Ui £ Cw{F) for i ^ j). 

• Sj^w contains tuples u = (ui, . . . , itf) and y_= {vi, . . . ,vi) for which Ui and 
Vk with i,k j are in the same component of F. 

Otherwise Sj^w is good; Observe that if Sj,w is good, then in Tv^t none of w's 
component node descendants Ui with u — (ui, . . . , Ui) e Sj^^ are ignored. For 
this reason we call 7'v,t good if every Sj_w is good. In the following we estimate 
the probability that Sj.w is bad. Clearly, there are at most n^~^ different tuples 
with Uj = w. Recalling that Vv,t denotes the vertex nodes of Tv,t, there are at 
most £n^~'^\Vv,t\ different tuples satisfying the second condition, and at most 
£'^n^~'^\Li{F)\ tuples to which the third condition applies. Similarly, there are at 
most £'^n?^^~'^^'^^\Li{F)\ pairs of tuples which satisfy the last condition. Recall 
that the random variables Au[t), which count the number of times u is in Et, 
are independent for different tuples u. So, using (|23|) . ([M)) and t ^ 1, whenever 
max{|V„^(|, |Li(F)|} ^ U holds we see that the probability of Sj^w being bad is 
at most 

. t^n' + 2£'n'-'U ■ t/n'-' + £^n''-'U ■ t'/n^^'-'^ ^ A£'U/n. (30) 

To understand the structural properties of Tv^t it will be useful to compare 
it with a closely related process that is simpler to analyze. Recall that when 
determining the Sj^w we only consider so far untested tuples. Thus each Sj^w 
is dominated (with respect to the subset relation) by 5*^,^1 j where for each of 
the n^~^ tuples u = [ui, . . . ,ui) with Uj = w, independently, the number of 
its arrivals is given by a Po{t/n^^^) distribution. There is a natural coupling 
between Sj^w and Sj^u, which only fails if Sj^^u contains u which are forbidden 
for Sj^w Since each of these 'bad' tuples contains at least one vertex from 
Vv,t, there are at most £\Vv.t\n^~'^ of them. So, with ([24|) and i ^ 1 in mind. 
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by considering the probability that Sj,w selects at least one of them, whenever 
\Vv.t\ ^ U holds it follows that 

dTv{Sj^w , Sj^w) ^ i\Vy^t\n'-^ ■ t/n'-^ < iU/n. (31) 

We now define T^^t = T^^t(i^) similarly to %_t: we employ the same construction 
except that we use (independent copies of) Sj^^ instead of Sj,^ and always 
proceed as if Sj^ru is good. Since each Sj^w is dominated by (may be regarded 
as a subset of) Sj^w, it follows that Tv,t is dominated by 1v,t with respect to 
the subgraph relation. Denoting the set of vertex nodes of l^^t by Vv,t, we see 
that Vv,t is dominated by '^v.t- 

The next lemma states that the number of vertex nodes in Tv,t and 1v,t have 
(uniform) exponential decay. 

Lemma 7. Suppose that ^ and pT|) - P^ hold with /? > 1. There exist a,A> 
(depending only on £, L, a, (3, B ) such that for all !^ t ^ a and s ^ 
we have P{\Vv.t\ > «) < P(|23„,t| ^ s) ^ Ae-""" , EiV^,(i/f ) ^ yle^^'n and 
F{Li{Hj^) ^ s) Ae-^'n. 

Before giving the proof of this result, which is based on branching processes 
arguments, we use it to show that Ty^t and %v,t can be coupled so that they 
typically agree. Note that at distance Ai + 1, 4i + 2, 4i + 3, 4j + 4 from the root 
Ty^t and 1y,t always have vertex, index, tuple and component nodes. 

Lemma 8. Suppose n ^ no{£, L, a, j3, B) and that the assumptions of Theorem\^ 
as well as (|27p hold. There exists a coupling of Ty^t and %y^t so that with 
probability at least 1 — (log n)^/n we have Ty t = T„ j and Tu^t is good. 

Proof. We write for the restriction of a rooted tree T to all vertices within 
distance at most i from the root. Let V* j and QJJ, ^ denote the vertex nodes in 
and J, respectively. Recall that U — (logn)^/^. Since /? > 1, note that 
LiIf) < f7 follows from ([11|) for n > no(B,/3). 

We inductively couple Ty l'^^ and T^'^^ for ^ i ^ J7 so that with probability 
at least 1-i- b^U^/n we have either max{|V;^;t+^|, \'^y^^\) ^ U, or 7;'^*+^ = 
ly^f^ with all Sj^yj of Ty^t^^ being good. The base case i = is straightforward, 
as both use the same procedure for generating the root and its children. Now 
suppose that we have constructed Ty^t^^ and 1^*^^ coupled as above. In the 
following we sequentially consider vertex nodes w at distance 4i + 1 from the root 
and extend the coupling to their descendants with distance up to 4(i+l) + l; here 
we clearly may assume | V^'j*"^ | — |2J^'t^^ | < U . For each vertex node w we create 
(. index nodes wi, . . .wi (of types 1, ...,£) . We abandon our coupling whenever 
we have found more than U vertex nodes (in which case we are done), so pip 
holds. Thus we can couple Sj,y, and Sj^w so that they agree with probability 
at least 1 — IV jn. Now we also abandon our coupling whenever Sj^w is bad, 
which happens with probability at most Ai'^U/n by ((30)) . The point is that given 
good Sj^w — Sj,y], in both cases the same deterministic construction is used for 
generating the descendants of Wj with distance up to 4(i + 1) + 1 from the 
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root. So, by repeating this for wi, . . . , wi, with probabihty at least 1 — M'^U /n 
we can couple the descendants of w with distance up to 4(i + 1) + 1 from the 
root. Since we follow this argument for each of the at most U vertex nodes at 
distance Ai from the root, we see that we can extend our coupling to ^ 

and T^^j'*'^'''''^ with probability at least 1 — 5£^[/^/n, establishing the claim. 

Finally, by Lemma [7] we know that V — max{|V«,t|, |5J„,t|} < C//10 holds 
with probability at least, say, 1 — for n ^ no(a, A). This together with the 
above coupling completes the proof (as there are no vertex nodes with distance 
larger than AV + 1 from the root). □ 

We now introduce an idealized 'infinite' version X^^t of the exploration tree 
that is defined without reference to n or _F, and in which 'bad' things (such as 
'ignored' component nodes) cannot happen by definition. Let R be the random 
variable with P(i? — k) = ip{k) for each fc ^ 1, where cp is given by Theorem |3l 
We start X^^t with a root node and add R vertex nodes as children. Then, given 
any vertex node, we deterministically create £ children (index nodes of types 
!,...,£). Each of these, independently, has Z ~ Po{t) children (tuple nodes). 
For each of these grandchildren we assign again (deterministically) i—1 children 
(component nodes of types 1, 1). All of these, independently, give birth 

to R many descendants (vertex nodes). 

For our subsequent analysis it will be key to observe that if we are only 
interested in equality up to isomorphisms, then we can generate 1y,t in a more 
convenient way, similarly to Xi^,t. Indeed, using standard properties of Poisson 
processes and noting that selecting a uniform tuple u = (ui, . . . , ue) with Uj = w 
is equivalent to picking £ — 1 random vertices, we can generate the descendants 
of Wj constructed by Sj^ui using the following three-generation tree process: 
the root has Z ~ Po{t) children (tuple nodes); then for each of the resulting 
children we construct (deterministically) £ — 1 grandchildren (component nodes 
of types !,...,£— 1), which each in turn give birth to N descendants (vertex 
nodes), where N ~ |Ctj(i^ )| for a uniformly and independently chosen vertex u. 
Comparing the resulting construction with X^p^t, it follows that we can generate 
Tt,_t up to relabellings in the same way as X^^t, with the only difference that we 
use N instead of R. 

Proof of Lemma^ Since Vv,t is dominated by (may be regarded as a subset of) 
^v,t, we have P(|V^,t| ^ s) ^ P(|QJ^^t| ^ s). Using this inequality, we claim 
that it is enough to prove existence of a, A > (depending only on £, L, a, /?, B) 
satisfying 

P(|9J^,,t| ^ s) < Ae-"" for all s ^ 0. (32) 

Indeed, recall that v is chosen uniformly at random, so that ¥{\Cv{H]^)\ ^ 
s I = G) = N-^s{G)/n. Taking expectations, we see that V.N-^s^H'^) = 
nP(|a(/ft^)| ^ s). "using dSSl) we have \C,{HT^)\ ^ |V„,t|, so P(|d(i?f )| ^ 
s) ^ Ae""* by (1211). Now Markov's inequality gives ¥{Li{Hf') ^ s) Ae-'^^'n. 

In the remainder we establish p2p using Lemmas [S] and [51 Let Zj be in- 
dependent copies of Z ^ Po{t), and let Vj^r,k be uniformly and independently 
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chosen random vertices. We henceforth construct T^_t up to relabelhngs, as 
described in the paragraph proceeding this proof. Given a vertex node w with 
distance Ai + 1 from the root, in this tree construction it has 

E E E 

vertex node descendants at distance A{i + 1) + 1 from the root, where E(VF) = 
et{£ - 1)S{F) ^ ea{i -l)L < 1 due to t ^ ct and ©. Note that Fw{z) = 
[Fz{[Fn{z)Y~^)Y, where N ^ \Cu{F)\ for a uniformly chosen vertex u. By pTI) 
we have [Fn{P)Y~'^ < B'^'^. Now, since Z - Po(t) and < i < cr, it easily 
follows that Fz{z) = e*(^-i) e"^ for z ^ 0, so FwW) ^ B ^ B{e,a,B). Let 
be the size of the Galton-Watson branching process in which each node, 
independently, has W children. Lemma [5] yields Fiy+{S) ^ D, where 6 > 1 and 
D > depend only on £, L, a, /3, B. Since the distribution of W does not depend 
on the ui or i considered above, it in particular follows that each vertex node 
with distance 1 from the root has vertex node descendants in 'Zy^t- 

Finally, note that T^^ t starts with a root vertex which gives birth to N vertex 
node children, each of whose vertex nodes descendants is given by independent 
copies of W^. With this in mind |2J^,t| ^ T, where T is a two-generation 
branching process where the root has N children, and then each of these, in- 
dependently, has children. Recall that F]\f{(3) ^ B and Fiy+{S) ^ D for 
/3, 5 > 1 and B,D > 0. So, Lemma [5] yields for A = i? and a > depending 
only on /3, 5, B, D. As explained, this completes the proof. □ 

Recall that X^_t uses the same construction as 1v,t, with the difference that 
it employs R instead of N. When establishing the exponential decay in the 
proof of Lcmma[7l note that the only properties of N used are KN = S{F) ^ L 
and FnW) ^ B. Since ER = xiv) ^ L and B hy dH)-®, the same 

argument thus carries over word-by-word when applied to the vertex nodes of 
which wc denote by Q3,^_t. 

Lemma 9. Suppose that ([6]) -([8]) and ([9]) hold with f3 > 1. There exist a,A>0 
(depending only on £, L, a, l3, B) such that for all ^ t ^ a and s ^ we 
have P(|QJ^_t| ^ s) ^ Ae^"^" , where a, A are defined in the same way as in 
Lemma^ □ 

After these preparations, we are now ready to show that we can couple Tv t 
and X^^t so that they typically agree up to isomorphisms (by using as an 
'intermediate' process). 

Lemma 10. Suppose n ^ nQ{£, L,a, (3, B) and that the assumptions of Theo- 
rem\^as well as (j27l) hold. There exists a coupling ofTv,t o,nd X^_t so that with 
probability at least 1 — (log n)'-^+^n~^/^ we have Tv.t — X^^t o,nd Tl.t is good. 

Proof. Recall that [/ = (logn)^/^. By Lemma|8]it suffices to couple T^^t and Xip_t 
so that with probability at least 1 — (log n)^n^^^^ we have Tt,^t = X^j. 

To this end we use a similar but simpler argument as in the proof of Lemma [51 
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inductively extending our coupling from distance 4z + 1 to 4{i + 1) + 1 from 
the root. As before, using Lemma [7] and |9] we can safely abandon our coupling 
whenever we have seen at least U vertex nodes, or when we reach distance 
U from the root. In the inductive step, the only difference between X^p^t and 
Iv^t is that Xip,t uses R whereas Tt,_t uses TV. Recall that P{R = k) = (p{k) 
and F{N = k) = Nk{F)/n. It is not difficuh to see that ^ and ^ imply 
P(i? ^ [/) s; and P(7V ^ [/) = for n ^ no{l3,B). Using these tail 
estimates together with ([TU]), by distinguishing values smaller and larger than 
U we obtain 

dTv{R,N) ^ [/• (logn)'^n^i/2 2f/(logn)'^n~i/^ (33) 

We furthermore may safely abandon our coupling whenever some index node 
has Z ^ U children, since (using t ^ a) this occurs with probability at most 
for n ^ no(cr). The point is that this ensures that we only need to couple R 
and N at most £^t/^ times when going from distance 4i + 1 to 4(« + !) + !. So, 
each time we can extend the coupling inductively with probability at least, say, 
1 — 3£^U^(}ogn)'^n^^^^. Arguing as in the proof of Lemma |51 this completes 
the coupling argument. □ 

2.4.3 Expected component sizes 

After analyzing the tuple and component structure induced by Ef, we now 
consider the second exposure round, where the selected tuples are presented 
in random order to TZ. Intuitively, the coupling given by Lemma [TU] allows us 
to estimate ENk{H^) using X^pt- As we shall see, this also carries over to 

Recall that if the exploration tree Tv.t = T is good, then during its construc- 
tion no component nodes are ignored. As mentioned in Section 12.4. 2[ the key 
point is that if no nodes are ignored (i.e., all component nodes have at least 
one child), then from the structure of T (which includes the vertex types) we 
can reconstruct all tuples in Et and component sizes of F (up to relabellings) 
which are relevant for determining \Cy{H^)\. We denote the corresponding set 
of tuples and component sizes by Tr and Ct, respectively. As the above 'recon- 
struction' procedure only uses the tree-structure of T, it in fact can be applied 
to any exploration tree in which each component node has at least one child; 
so, in particular, to Xip,t = T. In the following we define |C^(T)| for any ex- 
ploration tree T, where we formally set |C''^(T)| = if T contains a component 
node with descendants. Otherwise, we traverse in (uniform) random order the 
tuples in Tr] for each tuple we present the component sizes of its vertices to TZ 
and update the list of components (and their sizes) according to the decisions of 
TZ (by adding the pairs selected by TZ). Finally, we define \C'^{T)\ as the size of 
the resulting component which contains the root vertex of T. Since the second 
exposure round of presents the tuples in Et to TZ in random order, a mo- 
ment's thought reveals that conditional on Tv,t — T being good, both \Cv{H^)\ 
and \C''^{Tv,t)\ have exactly the same distribution for size rules. So, for all /c > 1 
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we have 

n\C-o{H^)\ =k\%,t^T is good) - P(|C^(X,t)| = fc I r„.t = T is good). 

(34) 

Before using this observation to estimate E,Nk{Hj^), we first collect some basic 
properties of the function p, where we set 

p{k, t) = V{\C'^{X^^t)\ = k) for aU (fc, t) e N x IR+. (35) 

Lemma 11. Suppose that ([HI)-® and (O hold with /3 > 1. The function p : Nx 
R"*" — [0, 1] defined in (j35p depends only on ip,TZ,£ and satisfies X]fc>i P(^i^) = 
1 for all ^ t ^ a. Furthermore, there exist a,A > (depending only on 
£^ L,a, l3, B) such that for all ^ t ^ a and s ^ we have p{s,t) ^ Ae""*, 
where a, A are given by Lemma\^ 

Proof. The definitions of C^(-) and of X^^t depend only on 72., £ and on Lp, TZ, i, t 
respectively. So, from ([55]) we see that p : N x K+ — > [0,1] depends only on 
V?, TZ, £. Since the component containing the root vertex of X^^t can only contain 
vertex nodes of X^^t, we see that 1 ^ \C'^{X^^t)\ ^ |2?ip,t| holds, from which 
p{0,t) = follows. Furthermore, Lemma |9] implies p{s,t) ^ P(|QJ^_t| s) ^ 
Ae~°'^ for all s ^ 1, where a,A> depend only on £,L,a, j3,B. Similarly, for 
ah s ^ we have P(X^,t is infinite) P(|93>^,t| ^ s) Ae''"' . But Ae-"' 
as s — > oo, so P(Xi^_t is infinite) = 0, which in turn yields X]fc>i = 1- 1^ 

Lemma 12. Suppose n ^ no{£, L,a, (3, B) and that the assumptions of Theo- 
rem\^ as well as (1271) hold. We have 

EiVfe(i7f ) = p{k,t)n± {lognf+^n^/"^ for all fc ^ 1. (36) 

Proof. Similar as in the proof of Lemma [3 since v is chosen uniformly at ran- 
dom we have ENk{H^) = nF{\C^{H^)\ = fc). To prove the claim it thus 
suffices to relate P(|C„(ff4^)| = fc) and p{k,t) = P(|C^(3e<^,t)l = k). The cou- 
pling of Lemma [TU] implies that X^p^t — Tv.t holds with probability at least 
1 - (logn)C'+5n-i/2 foj. „ ^ nQ{£, L,a, 13, B). Hence 

n\C'^irv,t)\ = fc) = P(|C^(X^,OI - fc) ± 2(logn)^+5n-i/2. 
Since this coupling also implies that Tv.t is good, using (p4|) it follows that 

P(|a(i/r)| = fc) = P(|C^(r.,t)| = fc) ± 2i\ognf+^n-^/\ 
Finally, combining our findings and recalling (j35p . we readily obtain (1361) . □ 

Now we relate with by establishing that E7Vfe(i7f ) w EiVfe(i^t^). 
Lemma 13. Suppose that ^ t ^ I. Then for n ^ no{£) we have 

ENk{F^) = ENk{Hj^) ± fc(logn)ni/2 for allk^l. (37) 
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Proof. Observe that Nk changes by at most ik per step. So, for r ^ s we have 
E(7Vfe(Fj^) I = G) = Nk{G) ±{s- r)£k. Taking expectations and restricting 
our attention to r E {tn — i, tn} shows that for each i ^ we have 

ENk{F^^,) = ENkiF^J ± eki. (38) 

Set s — \ogn. Using t ^ 1, standard Chernoff bounds yield that \Et\ = 
tn± s with probabihty at least 1 — for n ^ ng. Combining this with (|25[l 
and ([38|) . we readily obtain 

ENk{H^) = ENk{F^) ±£ks±n- n'"^ , 

which implies ([37]) for n nQ{£), with room to spare. □ 

2.4.4 Concentration of component sizes 

In this section we establish concentration of Nk{F^) around its expected value. 
The main technical difficulty here is that few changes of the offered tuples might 
alter many decisions of size rules (as the component sizes observed in later 
rounds can change); as we shall see, the bounds for ii(-) implied by Lemma[7] 
will be a crucial ingredient for showing that this is typically not the case. 

Lemma 14. Suppose n ^ nQ{£, L, a, jS, B) and that the assumptions of The- 
orem hold. With probability at least 1 — n^^^*', for every ^ i ^ an we 
have 

Nk{F^) ^ENk{F^)±{lognfn^/^ /or aZn < fc < (log n) 2. (39) 

Proof. We sequentially draw cm random tuples and consider two associated 
graph sequences F^ and F^ , where the 'influence' rule I in each step simply 
joins all £ randomly chosen vertices by edges. Note that F^ C F^ always holds. 
Let C denote the event that _Li(FJ„) < U = (logn)^/^, which by monotonicity 
implies Li{F^) < J7 for all ^ i ^ cm. Combining Lemma [7] with ()26p . for 
n ^ no (a, A, a) we have, say, 

P(-/:) < 3v^ • f{Li{Hl) ^ [/) s$ (40) 

For every 1 ^ z ^ an let X^^i denote the number of vertices which satisfy 
\Cy{F]^)\ = k and \Cv{F'[)\ < U. When C holds no vertices are 'ignored' due 
to \C^{Ff)\ ^ [/, so we have X^.i = Nk{F^). Together with ^ this readily 
gives, say, EXk,i = ENk{F^) ± n'^. So, for A = U^^^n^^^ it follows that 

¥{{\Nk{F^) - ENk{F^)\ ^ 2A} n £) P{\Xk,^ - EXk^,\ > A). (41) 

Note that for every size rule TZ the random variable Xk,i can be written as 
Xk^i = f(v.i, . . . ,v^, where the denote the i?-tuples generated by the ^-vertex 
process in each step (uniformly and independently). We claim that the function 
/ satisfies \ f{uj) — f{u})\ ^ MU whenever w and (D differ in one coordinate, i.e., in 
one tuple. Assuming that (jZi , . . . , yield F/^ and F^, respectively, let F/^ and 
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Ff denote the graphs which result by changing to . Since Ff and Ff only 
differ in the edges induced by v_j and y_j , there is a set of vertices W containing at 
most 2t components in each of F^ and Ff so that outside of W the component 
structure of both graphs is the same (to see this note that the order is irrelevant 
for T, so we may assume i = j; then defining W as the union of the components 
containing the vertices of Vj and Vj in F^_i — F^__i suffices). The key point is 
now that for size rules the decisions of TZ in F^ and F^ are the same for all 
tuples which contain no vertices from W (indeed, if a decision of TZ is modified 
then any changes of the resulting component sizes can only 'propagate' inside 
the components of F^ and F^; so only tuples containing vertices from W can 
be affected). It follows that the component structure outside of W is also the 
same in F^ and Fj^. Recall that W contains at most 2£ components in each of 
F^ and F^. So, since Xk^i only counts those vertices v with \Cv{F^)\ < U, we 
see that a change of one tuple can alter / by at most 2 • 2£ • [/, as claimed. So, 
recalling that 1 ^ i ^ an, for n ^ nQ{i,a) McDiarmid's inequality [15j implies 

/ 2A2 \ 

P(|Xfc,, - EXkA ^ A) exp (^-^^\ <; n-300 . (42) 



Finally, after combining (|40l) - p2)) . taking a union bound to account for all 
choices of 1 ^ i ^ crn and 1 ^ fc ^ (logn)^ completes the proof (noting that the 
claim is trivial for 1 = 0). □ 

Using the main idea of the above proof we can directly show that EiVfc(_F/^) 
is essentially independent of the initial graph F^ — F for i ^ an: for any two 
graphs F, F satisfying the assumptions of Theorem [3] their expected values can 
differ by at most, say, {\ogn)'~^~^^n^^^ . The key point is that for such graphs 
we can construct a bijection ^ between their vertex sets which, up to an excep- 
tional set W of at most, say, 4J7(logrt)'^n^/^ vertices, preserves the component 
structure of F and F, respectively. Now, using we couple F^, Ff and F]^, F^ 
in a measure preserving way. Since changes can only propagate inside the com- 
ponents of the 'influence' graphs, only those vertices whose components in F^ 
or F^ contain vertices of W or ^(W^) can be 'spoiled'. Intuitively, since the 
components usually have size at most U, under this coupling Nk thus typically 
differs by at most 2\W\ ■ U for both graphs. Taking the error probability of 
max{Li(i^^„), Li{F^^)} < U into account, the claim now follows without much 
work. 



2.4.5 Putting things together 

In this section we combine our findings to prove Theorem [3] Lemma [TT] easily 
implies the first part, i.e., existence of p : N x IR+ — )• [0,1] with the desired 
properties. Indeed, it ensures that for every cr ^ satisfying © there exist 
a, A > (depending only on L, cr, /3, B) such that for every t e [0, a] we have 
Efc>iP(fc,0 = 1 and 

p(s, t) Ae-"' for ah s > 0. (43) 
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For later usage we remark that Lemma [7] holds for the same a, A. Let D = 
300/a > and ^ = min{e''/2, e^/^''^)} > 1. Now, usmg (gS]) we see that 

5]p(fc,t)^^<A^e-«^/2 = B-l, (44) 

with 1 < i? < oo depending only on a, A. Similarly, we obtain 

X{p, t)=J2 kp{k, t)^AY, ke--'^ = 1-1, (45) 

with 1 < L < oo depending only on a, A. Summarizing, equations (jH])-® hold 
when /?, B, L, ip{-) are replaced by /3, -B, L, p(-, t), with room to spare. 

Turning to properties of F]^ , from Lemmas [T2HT41 it follows that with prob- 
ability at least 1 — n^^^*', for every < i < an (by considering t = i/n G [0, a]) 
we have 

NkiFj^) = p{k,i/n)n±3{\ognf+'^n^/^ for ah 1 «C fc ^ (logn)^ (46) 

for n ^ nQ{£, L, <t, f3, B). Recall that Lemma[7]holds with the a, A chosen above. 
By definition of D it follows that, with probability at least 1 — n~^^°, we have 

L^{Ff^) Dlogn (47) 

for n ^ n[){A). In the remainder we assume that (|15)) - (H71) hold. Recalling (|43p 
and the definition of D, note that for all k ^ Dlogn and ^ i ^ cm we have, 
say, \p{k,i/n)\ ^ for n ^ ni^{A). Using (|47| it follows that 

NkiF^) = p{k,i/n)n± (logn)'^^^/^ for all fc ^ D log n and s$ i s$ an. 

Together with for every i ^ an this establishes ([TU| with C, F, (^(•) 

replaced by C,F^, p{-,i/n) for n > no{D), where C" = C + 9. Now, using (^i]! 
and (|^ - (|T7)) we see that for every ^ i ^ crn we have (as i/n e [0, cr]) 

^ iVfe(^;^)/3'= p{k,i/n)p' + 3D{lognf+''n^/^ ^ /3*'- 

^ n^p(/c,i/n)/3'' + 3i:»^(logn)'^+^n^/^ < Bn 

k^l 

for n > no(C, £>), which establishes ([TTl) with (3,B,F replaced by ^,B,F^. 
It remains to show that (ini)-([T3]) hold. Recall that S{G) = J2k&ln] kNu{G)/n. 
Now, assuming n ^ 7io(aj -D) and using (P5)) - (|T7)) similarly as above, for every 
^ i ^ an we have 

S{F]^)^Ykp{k,i/n)±A ^ ke-"^ ±iD'^{\ognf+^n-^'^ 

k^l fc^Dlogn 

= x{p,i/n)±4:D'^{\ognf+^n~^^^, 

which establishes (|13p for n ^ no{D). Finally, recalling (|45|) . it follows that (|12p 
holds with L, F replaced by L, F^ for n ^ no(C, I?), which completes the proof 
of Theorem [31 
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3 When does % = tc? 



In this section we discuss Conjecture [2l first showing that it does hold for many 
size rules and then, in Section[3?2l showing that it cannot be extended to general 
^-vertex rules, i.e., that the critical point where the susceptibility blows up need 
not always coincide with the percolation threshold. 



3.1 Rules with uniform random edges 

It is well known, and not hard to check, that under suitable assumptions the 
graph Fgn given by adding 9n independent and uniformly random edges to a 
given n- vertex initial graph F can be viewed as an instance of the inhomogeneous 
random graph model of Bollobas, Janson and Riordan [7]. To make this precise, 
consider instead the (multi-)graph Fg obtained from F by adding a Poisson 
number Po{29/n) of copies of each of the (2) possible edges, with these numbers 
independent; we may then ignore multiple edges, as we are only interested in 
the component structure. Since Po(6'(n— 1)) edges are added in total, and there 
will be few multiple edges, Fg and Fgn are essentially interchangeable (one may 
use domination arguments comparing them for different 9 to make this precise). 
Given two components Ci and C2 of F, the number of edges between them in 
Fg is Poisson with mean |Ci||C2|26'/n. Making (for a change) the n dependence 
explicit, let iJ„ be the random graph whose vertices are the components of F, 
with an edge between two vertices if these components are joined by an edge of 
Fg. We say that a vertex of Hn has type k if the corresponding component of 
F has k vertices. Then the probability of an edge between a given type-« vertex 
and a given type-j vertex of Hn is 1 — e~^**^/", which is around 29ij/n if i and j 
are not too big, and the events that different edges are present are independent. 

More precisely, let K(i,j) = 29ij for all positive integers i and j. Suppose 
that /i is a finite measure on Z"*", i.e., that fik = fi{{k}) ^ for all k and 
< '^f.yj^ fik < 00. Let = be a random n- vertex starting graph. Suppose 
that, for each fixed fc ^ 1, 

as n — )■ 00, i.e., that i?„ has asymptotically /ifeji vertices of type fc, and that 



J2k^ik = l- (49) 



Then one can use (|^^ . the fact that Fn has n vertices and to show that 
whenever K(n) — >■ 00 we have 

N^K(n){Fn)/n^O, (50) 

and it follows that for any A C Z+ we have 

E^^Ea'- (51) 
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In the terminology of [7], this means that the (random) sets of vertices of the 
graphs Hn, together with their types, form a generalized vertex space on the 
generalized ground space (Z+j/Lt). Taking A — Z+ in (|5ip. we have in particular 
that \Hn\/n A /^(Z^) G (0,oo). By the function k forms an integrable 
kernel on the ground space with integral 20. Finally, the technical 

'graphicality' condition of [7] is met since Fg has asymptotically On edges. It 
follows that under these assumptions, the results of [7] apply to Hn (see Remark 
2.4 there). The most important of these results is [71 Theorem 3.1], which tells 
us that Hn will whp contain a giant component (one with Q{n) vertices) if and 
only if IITkII > 1, where is a certain integral operator associated to k. In 
particular, if ||Tk|| > 1 then there is some constant a = a{K, fi) > (anything 
smaller than the quantity p{k) in [7j) such that whp Hn has a component with at 
least an vertices. For the particular k considered here, which is 'rank 1', we have 
II^kII = X)fc26'fc^Aifc; see (16.8) in [7]. Note that if Hn contains a component 
with at least an vertices, then so does Fg - the union of the components of 
F corresponding to these vertices of Hn- So, in short, if (|48p and (|49p hold, 
then Fg will have a giant component (whp) if (and, one can check, only if) 
J2k 20k'^iik > 1- Moreover, it is not hard to check that these conclusions remain 
true if we delete some subset of the components of Fn, and adjust /z, as long as 
(|48|) holds for the new graph and Q{n) components remain; this is because l|5ip 
still holds, and the kernel is still graphical. 

We shall apply the observations above with initial graph F — Fn — G^„, 
where TZ is some vertex size rule. By ([5|), the condition (|48|) holds with pk = 
Pk{th)/k. Furthermore, as noted after ([5]), we have J2kPk(tt,) — 1, which gives 
pg)) . Finally, note that 

llr.ll ^J2^0kpkit^) = 20s{t^) = c^, (52) 

since s{t) = X]fc Pfe(^) diverges at i = tb- So far this tells us only that if we run 
any size rule up to time t = and then switch to adding uniformly random 
edges, after any constant times n further edges a giant component will emerge. 
The key point is that variants of this argument can be used to study the further 
evolution of Gf- for suitable rules TZ. A related approach was taken in [21] 
and [12]. 

Theorem 15. Let TZ be a bounded-size £-vertex rule. Then the conclusion of 
Conjecture\M holds for TZ; in particular, t^ = t^ , and moreover for any e > 
there is an a > such that whp ii(G^^^^jjJ ^ an. 

Note that this result was proved for some bounded-size 4-vertex rules (ones 
in which either viV2 or V3V4 is added) already by Spencer and Wormald |21| . 

Proof. By definition of bounded-size rules, there is a constant B such that TZ 
treats all components of size greater than B in the same way. Consider the graph 
generated by the rule after t^n steps. Let W be the set of vertices of this 
graph in components of size greater than B, and let F ~ Fn be the subgraph 
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of Gj^n induced by W. Noting that s{th) = ^k^Pk{th) = oo > _B, we have 
Pk{th) > for some k > B, and it fohows that for some constant /3 > 0, we have 
\W\ ^ f3n whp. From now on we assume that this is the case. In ah subsequent 
steps of our original process G^, every vertex of is in a component of size 
greater than B. Fix e > 0. Let us cah a step good if in this step all £ selected 
vertices are in W. Then each step is good with probability at least /3^, and it 
follows that whp at least 9n of the next en steps are good, where 9 = e/?^/2 is 
a positive constant. Again using the definition of a bounded-size rule, in each 
good step at least one edge is added and by symmetry it is chosen uniformly at 
random from all possible edges with ends in M^. It follows that we may couple 
^(tb+6)n ^^'^ so that whp the former contains the latter as a subgraph. But 
F satisfies the assumptions above with /x^ — pk{th)/k ioi k > B and fik — 
for k ^ B. Since the sum in ([52|) remains infinite after removing the first B 
terms, Theorem 3.1 of 7 and the discussion above imply that for some positive 
a, whp Fg contains a component with at least an vertices. □ 

Our next result concerns a different generalization of the Bohman-Frieze 
process [4]. Let us call an Achlioptas rule TZ take-it- or-leave-it if, when presented 
with a choice of two edges ei and 62, the rule decides which to select depending 
only on the current graph and on ei. In other words, the rule first sees ei and 
must decide whether to take this edge of not; if not, it selects the uniformly 
random edge 62- Bounded-size rules of this type were studied, for example, by 
Bohman and Kravitz [6]; here we do not assume that the rule is bounded-size. 

Theorem 16. Let TZ be a take-it- or-leave-it size rule. Then the conclusions of 
Conjecture and Theorem \15\ hold for TZ. 

Proof. Consider the process (G^„^j)i^Oj i-e., our Achlioptas process started at 
step thn. As above, set F = F„ = G^„. Since 7?. is a take-it-or-leave-it rule, the 
further evolution may be described as follows. Let £1 and £2 be independent 
lists of independent (potential) edges each chosen uniformly at random from all 
(2) possibilities. In step i of our process (step t^n -\- i of the original), take for 
ei the ith element of £1. The rule now decides whether to add this edge to the 
current graph. If not, take for 62 the next edge from £2, and add that. Thus, 
the jth time that the rule declines the first edge, we take the jth edge from £2. 

Since the edges in £2 are uniformly random, the discussion before Theo- 
rem [15] shows that for any constant S > 0, whp the first Sn edges from £2 will, 
when added to F = G^„, be enough to form a giant component. Fix e > 0, 
and define as above a graph whose vertices are the components of F, with 
edges corresponding to the first en edges from £1. As noted before, this graph 
Hn may be viewed as an instance of the model studied in [7] , and there is some 
a > such that whp Hn has a component with at least an vertices. Further- 
more, by the stability result [71 Theorem 3.9], there is some S > such that 
whp Hn has the property that deleting any Sn edges still leaves a component 
with at least an/2 vertices of if„. Hence, whp £1 has the property that if we 
add any subset of at least (e — d)n of the first en edges to F, we will create a 
component of size at least an/2, and whp £2 has the property that adding its 
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first 5n edges to F creates a component of size at least some constant times n. 
But wlien both properties liold, tlien wliatever the rule does, G^^^^^^^^ will have 
a giant component. □ 

For our final result, let us call an Achlioptas rule large-biased if there exists 
some constant B such that if both endvertices of ei are isolated vertices (com- 
ponents of size one) and both endvertices of 62 are in components of size greater 
than B, then the rule will select 62. Perhaps the most interesting examples of 
such rules are the reverse product rule, where we select the (a if there is a tie) 
edge maximizing the product of the sizes of the components containing its end- 
vertices, or the reverse sum rule, defined similarly but with product replaced by 
sum. Perhaps surprisingly (given the difficulty of analyzing the usual product 
rule), we can prove Conjectured] for such rules. 

Theorem 17. Let TZ be a large-biased size rule. Then the conclusions of Con- 
jecture\^and Theorem \15\ hold for TZ. 

Proof. The proof is very similar to that of Theorem 1151 Indeed, as usual we 
start from F — G^„. As before, let W be the set of vertices of F in components 
of size greater than B. Call a subsequent step good if ei joins two vertices in 
components of size one and 62 joins two vertices in W . Since there are whp at 
least some constant times n isolated vertices in G^^_^-^j„, and (as before), W 
whp has size at least a constant times n, off an event of small probability the 
conditional probability (given the history) that the next step is good is always 
at least some positive constant. Furthermore, when a step is good, the added 
edge is uniformly random among all possible edges inside W . The remainder of 
the argument is as for Theorem [151 we omit the details. □ 

The results above all illustrate the idea that if we can find a reasonable 
number of uniformly random edges among the edges selected by our process, 
then the process will be 'well behaved' (will have tc = ib)- This approach can 
be used to prove Conjecture [2] for other special classes of size rules, but it seems 
that additional ideas are needed for the general case. 

3.2 Examples of delayed percolation 

Having given several partial results supporting our belief in Conjecture [21 in this 
section we show that the conjecture cannot be extended to arbitrary ^-vertex 
rules. More concretely, we give examples of simple rules that can delay the 
appearance of linear size components for Q(n) steps beyond the point where the 
susceptibility diverges. The rules we use behave like size rules almost all the 
time. 

We start by introducing the r-sum rule Sr, which is a 27™- vertex size rule. 
Given vertices {vi, . . . , vg) and the corresponding list of component sizes (ci, . . . , q), 
the r-sum rule adds the pair V2j-iV2j with the (smallest, if there are ties) j £ [r] 
that minimizes the sum C2j-i -\- C2j of the component sizes. Recall the defini- 
tion of F^ given in Section [21 informally it denotes the graph that we obtain 



28 



by starting with the initial graph Fj^ = F and then fohowing i steps of an 
Achhoptas process using the rule TZ (to decide which edges to add in each step). 
Intuitively, the next lemma states that the r-sum rule does not substantially 
change (uniform) polynomial tails for N-^k during some Sn steps (here we use 
Sr for concreteness; other size rules exhibit similar behaviour). 

Lemma 18. Let F he a graph on n vertices. Suppose there are x,C > Q and 
K — K{n) ^ 1 such that for all 1 ^ k ^ K we have 

N^k{F) ^ Ck-^'n. (53) 

Given r ^ 1 + 1/x there exists S — (5(x, C, r) > such if n is large enough then, 
with probability at least 1 ~ n^^^ , for all 1 ^ k ^ K' = minji^T, n^/I^'^^+^^l} we 
have 

N>k{Fin ) ^ 2Ck--n. (54) 
Proof Set 6 = 2-[(2+x)r+3](--(r-i)^ l^^- denote the event that for all 

^ i ^ i' and 1 ^ fc ^ fc' we have 

N^kiFf^) ^ 2Cfc-"n. (55) 

Observe that it suffices to show that £sn,K' fails with probability at most n~^^. 
For k ^ K' let Xk^i denote the indicator function of the event N^k{Ff'') ^ 
N^k{Ffz{). Set Xk = Ei<,<5„^fc.» and Yk = Ei<»<5„ Yk,^, where 




if holds, 
otherwise. 



Note that in each step a new component of size at least k is only created by 
Sr if in each pair V2j-iV2j at least one vertex is in a component of size at least 
\k/2\ . So, whenever fi„i_fe„i holds, using ([55]) and r ^ 1 + we see that the 
probability that Xk,i = 1 is at most 

( 2N^^,/,^{Ffz,) \ ^ ( AC V ^ (2^+-C)- ^ (2^+-C)- ^ 
y n j ^\{k/2Y) fc" ~" fci+^ 

Since Y^^i = whenever Ei^i^k-i fails, it follows that Y^ is stochastically 
dominated by a binomial random variable with 5n trials and success prob- 
ability £,k- Note that (|53l) implies C ^ 1. Now, using k ^ K' we have 
Sn^k > C/8 • ^1/2 ^ 600 logn for n ^ no, so standard ChernofF bounds yield 

F{Yk ^ 2Sn^k) < e-^"^"/^ < n-^°°. (56) 

Next we claim that Esn,k-i and Y^ < 2dn£,k together imply £sn,k, so that 
f{-^£sn,k) ^ ¥{^£sn,k-i) +^{Yk ^ 25n£_k)- Indeed, by monotonicity £5n,k-i 
implies £i.k-i for every ^ i ^ 5n, so Yj. — X^- Now, since N^k increases by 
at most 2k per step, by choice of S it follows that 

N^kiFf;;) ~ N^kiF^n ^ 2kYk s: 4,5 {2^+^Cyk-^n Cfc-^n, 
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which together with N;^k{F^^) = N^k{F) ^ Cfc-^n implies N^k{Ff;;^) < 
2Ck~^n, as claimed. Iterating the above argument for k ^ K' and noting 
that £sn,i always holds due to C > 1, using ((55)) we obtain 

and the proof is complete. □ 

Let Vr denote the rule which always adds the pair viV2 during the first 
n/2 steps (corresponding to an Erdos-Renyi evolution with i — 2); afterwards 
it 'switches' and uses the r-sum rule Sr- The point is that many properties 
of the 'critical' Erdos-Renyi random graph G„ „/2 £^re well known: there exist 
constants C,a > and a function K = K{n) with ii' — > oo as n oo such 
that whp S'(G„,„/2) ^ and N^k{Gn,n/2) ^ Ck-^'^n for SiWl^k^K. So, 
by conditioning on these properties and then using Lemma [TSl we immediately 
deduce the main result of this section. Indeed, using the rule 1?^ for r 3 we 
whp have diverging susceptibility after n/2 steps, but in 5n subsequent steps 
whp no linear size components appear (in fact, in this case % — \/2 < holds). 

Corollary 19. For every r ^ 3 there exists 6 — S{r) > such that we have whp 

Alternatively, using essentially the same line of reasoning, we obtain a similar 
result by switching after the first step where the susceptibility is at least L = 
L{n) = 1^(1), for L not too large. Furthermore, we can replace by other 
suitable size rules. For example, the rule M.i^ which always connects two vertices 
with the two smallest component sizes Cj , satisfies an analogue of Lemma [18] 
for £ ^ 2 + Xjx. So the rule Cf, which switches from an Erdos-Renyi evolution 
(always adding v\V2) to M.i after n/2 steps, yields another example with t\^ < tc- 

Corollary 20. For every £ ^ A there exists 6 — d{i) > such that we have whp 
SiG^,) = ^(1) and ii(Gf//2+,J = o{n). 

Note that the examples given in Corollary [19] and [20] always behave like size 
rules except that once between two steps they change the rule used (by only 
querying natural parameters such as the number of vertices and steps, or the 
susceptibility). So, one can argue that Conjecture [2] already fails for a rather 
restricted superset of size rules. 
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A Appendix 

In this appendix we show that, as long as an Achlioptas process contains only 
small components, it will have a very simple cycle structure: most components 
will be trees, some will be unicyclic, and there will (whp) be no 'complex' 
components, i.e., ones containing more than one cycle. In fact, we prove the 
result for the more general class of ^-vertex rules. However, here we need an 
additional assumption: in each round the set of added edges is a forest. Allowing 
slightly greater generality, we call a rule acyclic if the edges between vi, . . . ,Vi 
added in a single step correspond to a forest on 1, . . . , ^. This in particular holds 
if in each stcip at most two edges arc added. Note that such an assumption is in 
fact necessary for £ ^ 3, since always connecting all vertices in each step quickly 
creates many cycles and complex components. 

Lemma 21. Let £^2 and let TZ be an acyclic £-vertex rule. Given < 5 < 1/4 
and U = U{n), suppose that for n ^ no{£,S) we have 1 < < n^^'^~^ . For 
n > no{l,5), with probability at least 1 — n~^/'^ the following holds for every 
< z < n^^^ : in Gf- there are no components of size at most U which contain 
at least two cycles, and the nvmber of vertices in components of size at most U 
with exactly one cycle is at most U^n^^ . 

Proof. Set m = n^^^ . Let Bi^i denote the event that in step i one of the 
following happens: (a) at least three of the £ randomly chosen vertices are 
contained in the same component of size at most U, or (b) there are randomly 
chosen vertices u)i.u)2.u):^,Wi and two components Ci,C2 of size at most U 
such that wi,W2 G C'l and ?iA3,W4 G C2. It is easy to see that Bi^i holds with 
probability at most ^^C/^/n^ + £^17"^ /n"^. So, denoting by Bi the event that Bi,, 
holds for some i < m, we see that 

P(^Bi) < m • 2e^U^/n^ < 2£%-^/^-^. 

Let Si^u denote the set of vertices of which are in components of size 

at most U containing exactly one cycle, and let B2 be the event that Si^u 
contains at least 2£^Um/n components for some i ^ m. Then -1B2 implies 
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\Si,u\ ^ 2£^U'^m/n = R for every i ^ m, where R < U'^-n?^ for n ^ no(^,(5). 
Since in each step the number of components in Si^jj changes by at most when 
B2 holds there are at least 21'^Um/n steps in which the number of components in 
Si^u increases. This can only happen if at least two randomly chosen vertices are 
in the same component of size at most U . Since in each step this has probability 
at most f'U jn, the expected number of such steps is bounded by A = PUm/n. 
Using standard Chernoff bounds (and stochastic domination) it follows that 

Let denote the event that in some with i ^ m there exists a compo- 
nent of size at most U which contains at least two cycles. In each step where 
Bi^i fails, note that a new complex component of size at most U can only be 
created if (a) at least two randomly chosen vertices are in Si-i^u or (b) one 
randomly chosen vertex lies in Si-i^jj, and two randomly chosen vertices are in 
the same tree component of size at most U. So, by considering the probability 
that this happens for some i ^ m (assuming |S'i_i^(7| ^ R), we see that 

¥{^Bi n -62 n s$ m • {fR^/n^ + fRU/n^) s$ 5^^C/^mV«^ ^ bi^n'^ 

completing the proof for n ^ no{S,£). □ 

Theorem [T] tells us that for size rules, for any fixed t < the largest 
component of whp has size at most O(logri). Taking U — (logn)^, say, we 
see that if TZ is acyclic, then whp there are no complex components, and at most 
7jo(i) vertices on cyclic components - in other words, almost all components are 
trees. (We have not tried to optimize the bound here - the method actually 
gives some power of logn.) 
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